Are People Successful at Learning Sequential Decisions on a Perceptual Matching Task?
نویسندگان
چکیده
Sequential decision-making tasks are commonplace in our everyday lives. We report the results of an experiment in which human subjects were trained to perform a perceptual matching task, an instance of a sequential decision-making task. We use two benchmarks to evaluate the quality of subjects’ learning. One benchmark is based on optimal performance as defined by a dynamic programming procedure. The other is based on an adaptive computational agent that uses a reinforcement learning method known as Q-learning to learn to perform the task. Our analyses suggest that subjects learned to perform the perceptual matching task in a near-optimal manner at the end of training. Subjects were able to achieve near-optimal performance because they learned, at least partially, the causal structure underlying the task. Subjects’ learning curves were broadly consistent with those of model-based reinforcementlearning agents that built and used internal models of how their actions influenced the external environment. We hypothesize that, in general, people will achieve near-optimal performances on sequential decision-making tasks when they can detect the effects of their actions on the environment, and when they can represent and reason about these effects using an internal mental model.
منابع مشابه
Are People Successful at Learning Sequences of Actions on a Perceptual Matching Task?
We report the results of an experiment in which human subjects were trained to perform a perceptual matching task. Subjects were asked to manipulate comparison objects until they matched target objects using the fewest manipulations possible. An unusual feature of the experimental task is that efficient performance requires an understanding of the hidden or latent causal structure governing the...
متن کاملComparing Bandwidth and Self-control Modeling on Learning a Sequential Timing Task
Modeling is a process which the observer sees another person's behavior and adapts his/her behavior with that which is the result of interaction. The aim of present study was to investigate and compare effectiveness of bandwidth modeling and self-control modeling on performance and learning of a sequential timing task. So two groups of bandwidth and self-control were compared. The task was pres...
متن کاملShort-term gains, long-term pains: how cues about state aid learning in dynamic environments.
Successful investors seeking returns, animals foraging for food, and pilots controlling aircraft all must take into account how their current decisions will impact their future standing. One challenge facing decision makers is that options that appear attractive in the short-term may not turn out best in the long run. In this paper, we explore human learning in a dynamic decision making task wh...
متن کاملEffects of cognitive functions on feedback request strategy and learning of a perceptual motor task
Taking individuals' cognitive abilities into consideration can play an important role in the initial stages of learning motor skills. So, the purpose of the present study was to investigate the effect of cognitive functions on feedback request strategy and learning of a perceptual motor task. A number of 60 university male students with a mean age of 22/4 years (SD = 1/99) were selected through...
متن کاملRapid decisions from experience.
In many everyday decisions, people quickly integrate noisy samples of information to form a preference among alternatives that offer uncertain rewards. Here, we investigated this decision process using the Flash Gambling Task (FGT), in which participants made a series of choices between a certain payoff and an uncertain alternative that produced a normal distribution of payoffs. For each choice...
متن کامل